|
|
Accession Number |
TCMCG075C29368 |
gbkey |
CDS |
Protein Id |
XP_017985047.1 |
Location |
join(3230830..3231210,3231932..3232177,3232271..3232696,3232808..3233002,3233089..3233243,3233748..3234633) |
Gene |
LOC18586283 |
GeneID |
18586283 |
Organism |
Theobroma cacao |
|
|
Length |
762aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018129558.1
|
Definition |
PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp3 [Theobroma cacao] |
CDS: ATGGACAAAGATAAACCCGCAACCAAACGGTCTCGAGACCGAGACGACAAGCACCACCGTAGCCACCACCGCGACTCCCACCGGCGATCGGATCATAAGTCCTCCTCATCGCGCCGTGATGAACGCGAGAGATCGTTTGAGAGGGAGGGTTCGAGAGATAGGGCGGCGGAGAAGTATCGTGAGGGATCCTATGAGGAAGTTGAAGCGAAGAAGAAGAGAAAAGAGAGGGAAGAGAGTGAGGAGGGAGGAGAGAAGAGAGCTAAAGTTGGAGAAGGAAACCGAGAGGAAAAAAGAGAGAGGAGAAGATTTGGGGATAAAGTAAAAGAAGAAGAGGAAATTGAATTCTCCAATGCTGCTAATGGAGGTGAACCAGTTCAAAATGGTGCTGCTCAGGCATCATTACCAAGGACTGGTCACCCCCTTTCTACCAAGGTACCTTCCATTTCTACTGCTGAAAATAAAGCATATAGTATTACCGGATCTCATGAGGTTACTGGATCTAGTACAGATGGCTCCTCTGCTGCTGGGAAAAGTGGTGGAAATCTCTCTCTTGATGCCTTAGCCAAAGCTAAAAAGGCTTTACAAATGCAGAAAGAACTAGCGGAGAAACTGAAGAAAATACCTTCATTGAACAGAGGCCCTAGCTCCAGTTCAGGAGTGACTACTGGAACAGTTCAGGGACCAGCCTCATCAGTTACTTATGCTATTGCTAGTGGACCGTCAAGCTCAGCAGTCCTTCCTCCTACCTCTGTGGCAGCAGCTTCCGTGAAGCAACCTGCTGGTGGGATGGCTTCTGTTCCTGGCCTTGCATCAATACCCAATTTAGAAGCTGTTAAACGTGCTCAAGAGCTGGCTGCTAAGATGGGATTTCGCCAGGACCCTCAGTTTGCCCCTCTAATAAACTTGTTCCCTGGACAGGTGCAAACGGATGTTCCGGTTCCTCAGAAACCTACCAAGGCCCCTGTTCTCCGAGTTGATGCACTTGGTAGGGAAATTGATGAACATGGTAATATCATAAATGTGACTAAACCCAGTAATCTTAGCACGCTTAAGGTTAACATTAACAAGCAAAAGAAAGATGCATTCCAGATCCTTAAACCTGAGCTGGAGGTGGATCCAGAATCAAATCCACATTTTGATTCGAGGATGGGAATCAATAAGAATAAGCTTCTTAGACCAAAAAGGATGACATTTCAGTTTGTAGAGGAAGGAAAATGGTCTAAAGACGCTGAGATCATTAAACTAAAGAGTCAATTTGGAGAAGCAAAGGCAAAAGAGCTAAAGGCAAAGCAAGCACAATTGGCAAAAGCAAAGGCTGATATAAATCCAAATTTGATAGAGGTGTCAGAAAGAATTATAACTAAGGAGAAACCGAAAGACCCAATTCCTGAAATAGAGTGGTGGGATCTGCCTATTCTGGTGTCTGGTTCTTACGGTGACATTACTGATGGTGTGGTGAATGAAGATAAACTGAAGATGGAGAAGATTACCATTTATGTTGAACATCCTCGTCCAATTGAGCCTCCTGCTGAGCCAGCTCCTCCACCGCCTCAGCCCCTGAAGTTAACCAAGAAGGAGCAGAAGAAACTACGCACACAGCGACGCCTGGCCAGGGAAAAGGATAGACAGGAGATGATTAGACAAGGCCTGATAGAACCGCCCAAGCCAAAAGTTAAGTTGAGCAATTTAATGAAAGTTCTAGGCTCTGAAGCTACCCAAGATCCTACTAAGCTTGAAATGGAAATCCATAGTGCCGCTGCTGAGCGGGAACAGGCTCATGTAGACAGGAACATTGCTCGCAAGCTTACCCCTGCTGAACGACGTGAAAAGAAAGAGAAAAAGCTTTTTGATGACCCAAATACAGTGGAGACTATTGTTTCAGTTTACAAGATCAATGACCTCTCACATCCCAAGACACGCTTTAAAGTTGATGTTAATGCCCAAGAAAACCGTTTGACTGGTTGCACTGTGATTTCTGAGGGTATTAGTGTTGTAGTTGTGGAAGGTGGAAGCAAATCCATTAAGAGGTATGGAAAACTTATGCTTAGGCGAATAAACTGGACTGAAGCTGTGAAAGAGGAAGACAAGGATGGAGATGAGGATGAAGAGAAACCTCCTAACAAGTGTGTGTTAGTTTGGCAAGGCAGCGTTGCCAAACCAAGTTTCAGTAAGTTCTCCGTCCATGAGTGCATCACTGAAGCGGCTGCAAAAAAGGTTTTTGCTGATGCTGGAGTGGCCCATTACTGGGACCTCGCGGTAAATTTCTCAGAAAATGAATTTGATTTTTGA |
Protein: MDKDKPATKRSRDRDDKHHRSHHRDSHRRSDHKSSSSRRDERERSFEREGSRDRAAEKYREGSYEEVEAKKKRKEREESEEGGEKRAKVGEGNREEKRERRRFGDKVKEEEEIEFSNAANGGEPVQNGAAQASLPRTGHPLSTKVPSISTAENKAYSITGSHEVTGSSTDGSSAAGKSGGNLSLDALAKAKKALQMQKELAEKLKKIPSLNRGPSSSSGVTTGTVQGPASSVTYAIASGPSSSAVLPPTSVAAASVKQPAGGMASVPGLASIPNLEAVKRAQELAAKMGFRQDPQFAPLINLFPGQVQTDVPVPQKPTKAPVLRVDALGREIDEHGNIINVTKPSNLSTLKVNINKQKKDAFQILKPELEVDPESNPHFDSRMGINKNKLLRPKRMTFQFVEEGKWSKDAEIIKLKSQFGEAKAKELKAKQAQLAKAKADINPNLIEVSERIITKEKPKDPIPEIEWWDLPILVSGSYGDITDGVVNEDKLKMEKITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLIEPPKPKVKLSNLMKVLGSEATQDPTKLEMEIHSAAAEREQAHVDRNIARKLTPAERREKKEKKLFDDPNTVETIVSVYKINDLSHPKTRFKVDVNAQENRLTGCTVISEGISVVVVEGGSKSIKRYGKLMLRRINWTEAVKEEDKDGDEDEEKPPNKCVLVWQGSVAKPSFSKFSVHECITEAAAKKVFADAGVAHYWDLAVNFSENEFDF |